Database development in toxicogenomics: issues and efforts.
نویسندگان
چکیده
The marriage of toxicology and genomics has created not only opportunities but also novel informatics challenges. As with the larger field of gene expression analysis, toxicogenomics faces the problems of probe annotation and data comparison across different array platforms. Toxicogenomics studies are generally built on standard toxicology studies generating biological end point data, and as such, one goal of toxicogenomics is to detect relationships between changes in gene expression and in those biological parameters. These challenges are best addressed through data collection into a well-designed toxicogenomics database. A successful publicly accessible toxicogenomics database will serve as a repository for data sharing and as a resource for analysis, data mining, and discussion. It will offer a vehicle for harmonizing nomenclature and analytical approaches and serve as a reference for regulatory organizations to evaluate toxicogenomics data submitted as part of registrations. Such a database would capture the experimental context of in vivo studies with great fidelity such that the dynamics of the dose response could be probed statistically with confidence. This review presents the collaborative efforts between the European Molecular Biology Laboratory-European Bioinformatics Institute ArrayExpress, the International Life Sciences Institute Health and Environmental Science Institute, and the National Institute of Environmental Health Sciences National Center for Toxigenomics Chemical Effects in Biological Systems knowledge base. The goal of this collaboration is to establish public infrastructure on an international scale and examine other developments aimed at establishing toxicogenomics databases. In this review we discuss several issues common to such databases: the requirement for identifying minimal descriptors to represent the experiment, the demand for standardizing data storage and exchange formats, the challenge of creating standardized nomenclature and ontologies to describe biological data, the technical problems involved in data upload, the necessity of defining parameters that assess and record data quality, and the development of standardized analytical approaches.
منابع مشابه
Recent progress in toxicogenomics research in South Korea
BACKGROUND The importance of toxicogenomics was recognized early in Korea and a group of researchers was trying to build up a research infrastructure and educational system. However, since the scale of the Korean pharmaceutical industry, which was expected to play the key role in toxicogenomics was small compared to that of advanced countries, industry-sponsored large-scale research projects an...
متن کاملWeb services-based text-mining demonstrates broad impacts for interoperability and process simplification
The Critical Assessment of Information Extraction systems in Biology (BioCreAtIvE) challenge evaluation tasks collectively represent a community-wide effort to evaluate a variety of text-mining and information extraction systems applied to the biological domain. The BioCreative IV Workshop included five independent subject areas, including Track 3, which focused on named-entity recognition (NER...
متن کاملThe Disease Portals, disease-gene annotation and the RGD disease ontology at the Rat Genome Database
The Rat Genome Database (RGD;http://rgd.mcw.edu/) provides critical datasets and software tools to a diverse community of rat and non-rat researchers worldwide. To meet the needs of the many users whose research is disease oriented, RGD has created a series of Disease Portals and has prioritized its curation efforts on the datasets important to understanding the mechanisms of various diseases. ...
متن کاملClearing the standards landscape: the semantics of terminology and their impact on toxicogenomics.
The emergence of the microarray data standards, especially the Minimum Information About a Microarray Experiment (MIAME), has spurred several organizations to develop their own standards for a myriad of technologies, including proteomics and metabolomics. These efforts have facilitated the creation of several large-scale gene expression repositories, including the toxicology-focused Chemical Ef...
متن کاملFORUM Clearing the Standards Landscape: the Semantics of Terminology and their Impact on Toxicogenomics
The emergence of the microarray data standards, especially the Minimum Information About a Microarray Experiment (MIAME), has spurred several organizations to develop their own standards for a myriad of technologies, including proteomics and metabolomics. These efforts have facilitated the creation of several largescale gene expression repositories, including the toxicology-focused Chemical Eff...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Environmental Health Perspectives
دوره 112 شماره
صفحات -
تاریخ انتشار 2004